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Abstract. We survey recent results on the topological complexity of context-free 
^-languages which form the second level of the Chomsky hierarchy of languages 
of infinite words. In particular, we consider the Borel hierarchy and the Wadge hi- 
erarchy of non-deterministic or deterministic context-free ^-languages. We study 
also decision problems, the links with the notions of ambiguity and of degrees of 
ambiguity, and the special case of ^-powers. 
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1 Introduction 

The Chomsky hierarchy of formal languages of finite words over a finite alphabet is 
now well known, IHU79I . The class of regular languages accepted by finite automata 
forms the first level of this hierarchy and the class of context-free languages accepted 
by pushdown automata or generated by context-free grammars forms its second level 
|ABB96|. The third and the fourth levels are formed by the class of context-sensitive 
languages accepted by linear-bounded automata or generated by Type-1 grammars and 
the class of recursively enumerable languages accepted by Turing machines or gener- 
ated by Type-0 grammars [Cho56|. In particular, context-free languages, firstly intro- 
duced by Chomsky to analyse the syntax of natural languages, have been very useful 
in Computer Science, in particular in the domain of programming languages, for the 
construction of compilers used to verify correctness of programs, IH MUOll . 

There is a hierarchy of languages of infinite words which is analogous to the Chom- 
sky hierarchy but where the languages are formed by infinite words over a finite al- 
phabet. The first level of this hierarchy is formed by the class of regular w-languages 



** UMR 5668 - CNRS - ENS Lyon - UCB Lyon - INRIA 
LIP Research Report RR 2008-17 



accepted by finite automata. They were first studied by Biichi in order to study deci- 
sion probems for logical theories. In particular, Biichi proved that the monadic sec- 
ond order theory of one successor over the integers is decidable, using finite automata 
equipped with a certain acceptance condition for infinite words, now called the Biichi 
acceptance condition. Well known pioneers in this reasearch area are named Muller, 
Mc Naughton, Rabin, Landweber, Choueka, QMul63|Nau66|Rab69|Lan69ICho74|| . The 
theory of regular w-languages is now well established and has found many applications 
for specification and verification of non-terminating systems; see [Tho90 Sta97a PP04| 
for many results and references. The second level of the hierarchy is formed by the class 
of context-free w-languages. As in the case of languages of finite words it turned out that 
an w-language is accepted by a (non-deterministic) pushdown automaton (with Biichi 
acceptance condition) if and only if it is generated by a context-free grammar where 
infinite derivations are considered. Context-free languages of infinite words were first 
studied by Cohen and Gold, MCG77ICG78al . Linna, HLin75ILin76ILin771 . Boasson, Ni- 
vat, ONiv78INiv77IBoa79IBN80l . Beauquier, |Bea841 . see the survey |Sta97al . Notice 
that in the case of infinite words Type-1 grammars and Type-0 grammars accept the 
same w-languages which are also the w-languages accepted by Turing machines with a 
Biichi acceptance condition [CG78b Sta97a|, see also the fundamental study of Engel- 
friet and Hoogeboom on X-automata, i.e. finite automata equipped with a storage type 
X, accepting infinite words, IE H93L 

Context-free w-languages have occurred recently in the works on games played on infi- 
nite pushdown graphs, following the fundamental study of Walukiewicz, [ WalOO Tho02 1 
nSer04blFin05cl . 

Since the set £ u of infinite words over a finite alphabet £ is naturally equipped with 
the Cantor topology, a way to study the complexity of a>languages is to study their 
topological complexity. The first task is to locate u>-languages with regard to the Borel 
and the projective hierarchies, and next to the Wadge hierarchy which is a great refine- 
ment of the Borel hierarchy. It is then natural to ask for decidability properties and to 
study decision problems like : is there an effective procedure to determine the Borel 
rank or the Wadge degree of any context-free w-language ? Such questions were posed 
by Lescow and Thomas in MLT94I . In this paper we survey some recent results on the 
topological complexity of context-free w-languages. Some of them were very surprising 
as the two following ones: 

1. there is a 1-counter finitary language L such that L u is analytic but not Borel, 
0FinO3bl . 

2. The Wadge hierarchy, hence also the Borel hierarchy, of w-languages accepted 
by real time 1-counter Biichi automata is the same as the Wadge hierarchy of ui- 
languages accepted by Biichi Turing machines, [Fin06a|. 

The Borel and Wadge hierarchies of non deterministic context-free w-languages are not 
effective. One can neither decide whether a given context-free w-language is a Borel 
set nor whether it is in a given Borel class YP a or n" . On the other hand deterministic 
context-free oj-languages are located at a low level of the Borel hierarchy: they are all 
A§-sets. They enjoy some decidability properties although some important questions in 
this area are still open. We consider also the links with the notions of ambiguity and of 



degrees of ambiguity, and the special case of a>powers, i.e. of w-languages in the form 
V u , where V is a (context-free) finitary language. Finally we state some perspectives 
and give a list of some questions which remain open for further study. 

The paper is organized as follows. In Section 2 we recall the notions of context-free 
w-languages accepted by Biichi or Muller pushdown automata. Topological notions 
and Borel and Wadge hierarchies are recalled in Section 3. In Section 4 is studied the 
case of non-deterministic context-free w-languages while deterministic context-free u>- 
languages are considered in Section 5. Links with notions of ambiguity in context free 
languages are studied in Section 6. Section 7 is devoted to the special case of w-powers. 
Perspectives and some open questions are presented in last Section 8. 

2 Context-free ^-languages 

We assume the reader to be familiar with the theory of formal (oj)-languages [ Tho90 Sta97al . 
We shall use usual notations of formal language theory. 

When E is a finite alphabet, a non-empty finite word over E is any sequence x = 

di . . ■ <2fc, where <ij G E for i = 1, . . . , k , and k is an integer > 1. The length of x is k, 

denoted by \x\. The empty word has no letters and is denoted by A; its length is 0. For 

x = a,\. .. Uk, we write x(i) = cij and x[i] = x(l) . . . x(i) for i < k and x[0] = A. E* 

is the set of finite words (including the empty word) over E. 

For V C E*, the complement of V (in E*) is E* - V denoted V~. 

The first infinite ordinal is lj. An co-word over E is an lo -sequence ax . . . a n . . ., 

where for all integers i > 1, ai £ E. When a is an w-word over E, we write 

a = <r(l)cr(2) . . . cr(n) . . ., where for all i, a(i) G E, and a[n] = <j(l)a(2) . . . cr(n) 

for all n > 1 andcr[0] = A. 

The usual concatenation product of two finite words u and v is denoted u.v (and some- 
times just uv). This product is extended to the product of a finite word u and an cj-word 
v: the infinite word u.v is then the w-word such that: 
(u.v)(k) = u(k) if k < \u\, and (u.v)(k) = v(k — if k > \u\. 
The prefix relation is denoted C: a finite word u is a prefix of a finite word v (respec- 
tively, an infinite word v), denoted u C v, if and only if there exists a finite word w 
(respectively, an infinite word w), such that v — u.w. The set of u-words over the 
alphabet E is denoted by E u . An ui-language over an alphabet E is a subset of E u . 
The complement (in E^) of an w-language V C E u is Z 1 " — 1/, denoted V~ . 
For 1/ C E*, the w-power of V is : 

= {a = mi . . . u n ■ ■ ■ G E^ | Mi > 1 u,e V}. 

We now define pushdown machines and the class of cj-context-free languages. 

Definition 1. A pushdown machine (PDM) is a 6-tuple M = (K, E, r, S, qo, Zq), 
where K is a finite set of states, E is a finite input alphabet, r is a finite pushdown 
alphabet, qo G K is the initial state, Zq G r is the start symbol, and 5 is a mapping 
from K x (17 U {A}) x r to finite subsets of K x r* . 

//7 6 r + describes the pushdown store content, the leftmost symbol will be assumed 



to be on "top " of the store. A configuration of a PDM is a pair (g, 7) where q £ K and 

7 er. 

For a G £ U {A}, /3, 7 G F* and Z € T, if (p, (3) is in <5(g, a, Z), then we write 
a : (g, Zj) 1 *m (p, Pi)- 

>-^* M is the transitive and reflexive closure of *—>m- (The subscript M will be omitted 
whenever the meaning remains clear). 

Let a = a\a 2 . . . a n . . . be an u-word over S. an infinite sequence of configurations 
r = (lii 7i)i>l i s called a complete run of M on a, starting in configuration (p, 7), iff: 

1- (?i,7i) = (p,7) 

2. for each i > 1, f/zere e^/ifi fei G X 1 U {A} satisfying hi : (qi,ji) (<Z;+i,7i+i) 
smc/z f/zaf aia,2 . . . a n . . . — 61&2 • • • b n . . . 

For every such run, In(r) is the set of all states entered infinitely often during run r. 
A complete run r of M on a , starting in configuration (go, Zq), will be simply called 
"a run of M on a". 

Definition 2. A Bilchi pushdown automaton is a 7 -tuple M = (K, S, _T, 5, qo, Zq, F) 
where M' — (K, S, F, 5, go, Zq) is a PDM and F C K is the set of final states. The 
u-language accepted by M is 

L(M) — {a G S u I there exists a complete run r of M on a such that In(r)C\F ^ 0} 

Definition 3. A Muller pushdown automaton is a 7 -tuple M — (K, S, r, 6, go, Zq, T) 

where M' = (K, S, r, S, go, Zq) is a PDM and T C 2 K is the collection of designated 
state sets. The u-language accepted by M is 

L(M) = {a G S u I there exists a complete run r of M on a such that In(r) G J-} 

Remark 4. We consider here two acceptance conditions for uj-words, the Bilchi and the 
Muller acceptance conditions, respectively denoted 2-acceptance and 3-acceptance in 
[Lan69J and in [CG78aJ and (in/, and (in/, =) in [Sta97a]. We refer the reader to 
[CG77.CG78a Sta97a EH93] for consideration of weaker acceptance conditions, and 
to [GTW02.PP04J for the definitions of other usual ones like Rabin, Street, or parity 
acceptance conditions. Notice however that it seems that the latter ones have not been 
much considered in the study of context-free cu-languages but they are often involved in 
constructions concerning finite automata reading infinite words. 

Notation. In the sequel we shall often abbreviate "Muller pushdown automaton" by 
MPDA and "Biichi pushdown automaton" by BPDA. 

Cohen and Gold and independently Linna established a characterization Theorem for 
w-CFL. We shall need the notion of "w-Kleene closure" which we now firstly define: 

Definition 5. For any family C offinitary languages, the u-Kleene closure of C is : 

lu-KC(C) = {U^Ui.V? I Vi G [1, n] U h V, G £} 



Theorem 6 (Linna |Lin75|, Cohen and Gold |CG77|). Let CFL be the class of 
context-free (finitary) languages. Then for any to-language L the following three con- 
ditions are equivalent: 

1. Le uj-KC{CFL). 

2. There exists a BPDA that accepts L. 

3. There exists a MPDA that accepts L. 

In I1CG77 I are also studied w-languages generated by w-context-free grammars and it 
is shown that each of the conditions 1), 2), and 3) of the above Theorem is also equiv- 
alent to: 4) L is generated by a context-free grammar G by leftmost derivations. These 
grammars are also studied by Nivat in |Ni v77|Niv78 |. Then we can let the following 
definition: 

Definition 7. An uj-language is a context-free u-language iff it satisfies one of the 
conditions of the above Theorem. The class of context-free uj-languages will be denoted 
byCFL^. 

If we omit the pushdown store in the above Theorem we obtain the characterization of 
languages accepted by classical Muller automata (MA) or Biichi automata (BA) : 

Theorem 8. For any to-language L, the following conditions are equivalent: 

1. L belongs to uj~KC{REG), 

where REG is the class of finitary regular languages. 

2. There exists a MA that accepts L. 

3. There exists a BA that accepts L. 

An co-language L satisfying one of the conditions of the above Theorem is called an 
regular uj-language. The class of regular uj-languages will be denoted by REG^. 

It follows from Mc Naughton's Theorem that the expressive power of deterministic MA 
(DMA) is equal to the expressive power of non deterministic MA, i.e. that every regular 
w-language is accepted by a deterministic Muller automaton, [Nau66PP04|. Notice 
that Choueka gave a simplified proof of Mc Naughton's Theorem in |Cho74|. Another 
variant was given by Rabin in [Rab69 1. Unlike the case of finite automata, deterministic 
MPDA do not define the same class of w-languages as non deterministic MPDA. 
Let us now define deterministic pushdown machines. 

Definition 9. A PDM M = (K, S, r, <5, qo, Zq) is said to be deterministic iff for 
each q <E K, Z E r, and a G S: 

1. 6(q, a, Z) contains at most one element, 

2. A, Z) contains at most one element, and 

3. if 5(q, A, Z) is non empty, then S(q, a, Z) is empty for all a G S. 

It turned out that the class of w-languages accepted by deterministic BPDA is strictly 
included into the class of ^-languages accepted by deterministic MPDA. This lat- 
est class is the class DCFL U of deterministic context-free w-languages. We denote 
DCFL the class of deterministic context-free (finitary) languages. 



Proposition 10 (|CG78a|). 



1. DCFL^ is closed under complementation, but is neither closed under union, nor 
under intersection. 

2. DCFL^ C uj~KC(DCFL) C CFL^ (these inclusions are strict). 

3 Topology 

3.1 Borel hierarchy and analytic sets 

We assume the reader to be familiar with basic notions of topology which may be found 
in MMos80|LT94|Kec95ISta97a|PP041 . There is a natural metric on the set of infinite 
words over a finite alphabet S containing at least two letters which is called the prefix 
metric and defined as follows. For u, v S and u =^ v let S(u, v) = 2 _i p rcf <™ >") where 
l pTe {r u ,v) i s m e first integer n such that the (n + l) st letter of u is different from the 
(n + l) st letter of v. This metric induces on 27" the usual Cantor topology for which 
open subsets of 17" are in the form W.ZJ U ', where W C IT*. A set L C 7£7" is a closed 
set iff its complement 17" — i is an open set. Define now the Borel Hierarchy of subsets 
of 17": 

Definition 11. For a non-null countable ordinal a, the classes X° and n° of the Borel 

Hierarchy on the topological space 7£7" are defined as follows: 

S° is the class of open subsets of S u , 11° is the class of closed subsets of S u , 

and for any countable ordinal a > 2: 

E° is the class of countable unions of subsets of S^ 1 in IJ 7<a H°. 

11° is the class of countable intersections of subsets of S w in IJ 7<Q S°. 

Recall some basic results about these classes : 

Proposition 12. 

(a) £° U 11° C £° +1 ("1 U^ +1 , for each countable ordinal a > 1. 

(b) U 7 < a S° = U 7 < Q II 7 C S° H 11°, for each countable limit ordinal a. 

(c) A set W C X" is in the class S° iff its complement is in the class 11°. 

(d) £° - 11° ^ and 11° - £° ^ hold for every countable ordinal a > 1. 

For a countable ordinal a, a subset of 7i7" is a Borel set of rank a iff it is in £° U 11° 
butnotinU 7<Q (S 7 Un 7 ). 

There are also some subsets of 7i7" which are not Borel. Indeed there exists another hi- 
erarchy beyond the Borel hierarchy, which is called the projective hierarchy and which 
is obtained from the Borel hierarchy by successive applications of operations of pro- 
jection and complementation. The first level of the projective hierarchy is formed by 
the class of analytic sets and the class of co-analytic sets which are complements of 
analytic sets. In particular the class of Borel subsets of 7i7" is strictly included into the 
class X} of analytic sets which are obtained by projection of Borel sets. 



Definition 13. A subset A of is in the class H\ of analytic sets iff there exists 
another finite set Y and a Borel subset B of (S x Y)" such that x G A <-» 3y S Y u 
such that (x,y) € B, where (x, y) is the infinite word over the alphabet £ x Y such 
that (x,y){i) — (x(i),y(i)) for each integer i > 1. 

Remark 14. In the above definition we could take B in the class Tl®. Moreover analytic 
subsets of S u are the projections of Tl^- sub sets of x uf , where uj^ is the Baire 
space, HMos80]l . 

We now define completeness with regard to reduction by continuous functions. For a 
countable ordinal a > 1, a set F C S u is said to be a S° (respectively, 11° , £})- 
complete set iff for any set E C Y u (with Y a finite alphabet): E 6 S° (respectively, 
E e 11°, 7? e £}) iff there exists a continuous function / : Y^ — > such that £ = 
f^ 1 (F). S° (respectively Il")-complete sets, with n an integer > 1, are thoroughly 
characterized in HSta86l . 

In particular 1Z = (0*.l) w is a well known example of il^-complete subset of {0, 1} U . 
It is the set of w-words over {0, 1} having infinitely many occurrences of the letter 1. 
Its complement {0, 1}" - (0*.l) w is a ^-complete subset of {0, 1} U . 

We recall now the definition of the arithmetical hierarchy of w-languages which form 
the effective analogue to the hierarchy of Borel sets of finite ranks. 
Let X be a finite alphabet. An cj-language L C I u belongs to the class S n if and 
only if there exists a recursive relation Rl C (N)™ -1 x X* such that 

L = {a e X u I 3ai . . . Q n a„ (a x ,..., a n -i,cr[a» + 1]) £ Rl} 

where Qi is one of the quantifiers V or 3 (not necessarily in an alternating order). An 
cj-language L C X^ belongs to the class 7T„ if and only if its complement X^ — L 
belongs to the class S n . The inclusion relations that hold between the classes S n and 
n n are the same as for the corresponding classes of the Borel hierarchy. The classes 
E n and 7T n are included in the respective classes S° and S° of the Borel hierarchy, 
and cardinality arguments suffice to show that these inclusions are strict. 

As in the case of the Borel hierarchy, projections of arithmetical sets (of the second 
77-class) lead beyond the arithmetical hierarchy, to the analytical hierarchy of uj- 
languages. The first class of this hierarchy is the (lightface) class S\ of effective analytic 
sets which are obtained by projection of arithmetical sets. An cj-language L C X u 
belongs to the class S\ if and only if there exists a recursive relation Rl C (N) x 
{0,1}* x X* such that: 

L = {a EX U \ 3t(t e {0,1}" AWn3m((n,T[m],a[m}) £ R L ))} 

Then an w-language L C X^ is in the class S\ iff it is the projection of an uj- 
language over the alphabet X x {0, 1} which is in the class 772. The (lightface) class 
n I of effective co-analytic sets is simply the class of complements of effective analytic 
sets. We denote as usual A\ — S\ n 77 {. 



Recall that that an ^-language L C is in the class S\ iff it is accepted by a non 
deterministic Turing machine (reading cj-words) with a Biichi or Muller acceptance 
condition MSta97a| . 

The Borel ranks of (lightface) A\ sets are the (recursive) ordinals 7 < K , where K 
is the first non-recursive ordinal, usually called the Church-Kleene ordinal. Moreover, 
for every non null ordinal a < wf K , there exist some S"-complete and some Un- 
complete sets in the class A\. 

On the other hand, Kechris, Marker and Sami proved in | KMS89 1 that the supremum of 
the set of Borel ranks of (lightface) Z^-sets is the ordinal 73. This ordinal is proved to 
be strictly greater than the ordinal 5\ which is the first non A\ ordinal. In particular, the 
ordinal 72 is strictly greater than the ordinal K . Remark that the exact value of the 
ordinal 73 may depend on axioms of set theory, see [KMS89 Fin06a| for more details. 
Notice also that it seems still unknown whether every non null ordinal 7 < 73 is the 
Borel rank of a (lightface) JC* -set, 



3.2 Wadge hierarchy 

We now introduce the Wadge hierarchy, which is a great refinement of the Borel hier- 
archy defined via reductions by continuous functions, | Dup01| Wad83|. 



Definition 15 (Wadge | Wad83|). Let X, Y be two finite alphabets. For L C X^ and 
L' C Y u , L is said to be Wadge reducible to L' (L <w L') iff there exists a continuous 
function f : X u — > Y u , such that L = / _1 (Z/). 

L and L' are Wadge equivalent iff L <w L' and L' <\y L. This will be denoted by 
L =w L' . And we shall say that L <w L' iff L <w L but not L <w L. 
A set L C X u is said to be self dual iff L =w L~, and otherwise it is said to be non 
self dual. 

The relation <yv is reflexive and transitive, and =w is an equivalence relation. 
The equivalence classes of =w are called Wadge degrees. 

The Wadge hierarchy WH is the class of Borel subsets of a set X u , where X is a finite 
set, equipped with <w and with =w- 

For L C X^ and V C Y u , if L < w L' and L = f~ l {L') where / is a continuous 
function from X u into Y u , then / is called a continuous reduction of L to L 1 . Intuitively 
it means that L is less complicated than V because to check whether x G L it suffices 
to check whether f(x) € L' where / is a continuous function. Hence the Wadge degree 
of an cj-language is a measure of its topological complexity. 

Notice that in the above definition, we consider that a subset L C I u is given together 
with the alphabet X. This is important as it is shown by the following simple example. 
Let L\ = {0, 1}" C {0, 1}" and L 2 = {0, 1}" C {0, 1, 2} w . So the languages L x 
and L2 are equal but considered over the different alphabets X\ = {0, 1} and X2 = 
{0, 1, 2}. It turns out that L\ <w L 2 . In fact L\ is open and closed in X" while L 2 is 
closed but non open in X%. 



We can now define the Wadge class of a set L: 



Definition 16. Let L be a subset of X u . The Wadge class of L is : 

[L] = {L | L C Y u for a finite alphabet Y and L' <\y L}. 
Recall that each Borel class X° and 11° is a Wadge class. 

A set L C I" is a £° (respectively 11° )-complete set iff for any set L' C 7", L' is in 
E° (respectively 11°) iff L' <w L . It follows from the study of the Wadge hierarchy 
that a set L C X u is a X° (respectively, Yl° a )-complete set iff it is in £° but not in 11° 
(respectively, in 11° but not in S° ). 

There is a close relationship between Wadge reducibility and games which we now 
introduce. 

Definition 17. Let L C X u and L' C The Wadge game W(L, L') is a game with 
perfect information between two players, player 1 who is in charge of L and player 2 
who is in charge of L'. 

Player 1 first writes a letter a\ G X, then player 2 writes a letter b\ G Y , then player 
1 writes a letter a 2 £ X, and so on. 

The two players alternatively write letters a n of X for player 1 and b n of Y for player 
2. 

After lj steps, the player 1 has written an u-word a £ X u and the player 2 has written 
an LO-word b £ Y u . The player 2 is allowed to skip, even infinitely often, provided he 
really writes an uj-word in lj steps. 

The player 2 wins the play iff [a £ L «-> b G L' ], i.e. iff: 

[(a G L and b G L') or (a ^ L and b ^ L' and b is infinite)/- 

Recall that a strategy for player 1 is a function a : (Y U {s})* — > X. And a strategy for 
player 2 is a function / : X + -^l^U {s}. 

a is a winning stategy for player 1 iff he always wins a play when he uses the strategy 
cr, i.e. when the n th letter he writes is given by a n = a(b\ . . . b n -i), where bi is the 
letter written by player 2 at step i and bi = s if player 2 skips at step i. 
A winning strategy for player 2 is defined in a similar manner. 

Martin's Theorem states that every Gale-Stewart Game G(X), with X a. borel set, is 
determined, see |Kec95|. This implies the following determinacy result : 

Theorem 18 (Wadge). Let L C X" and L' C Y u be two Borel sets, where X and 
Y are finite alphabets. Then the Wadge game L') is determined : one of the two 

players has a winning strategy. And L <yy L' iff the player 2 has a winning strategy in 
the game W{L, L'). 

Theorem 19 (Wadge). Up to the complement and = w, the class of Borel subsets of 

I", for a finite alphabet X, is a well ordered hierarchy. There is an ordinal \ WH\, 

called the length of the hierarchy, and a map d^- from WH onto \ WH\ — {0}, such 

that for all L, L' C X u : 

dyyL < dyyL' <-> L <\y L' and 

dP w L = dP w L' «-> [L = w V or L = w L'~]. 



The Wadge hierarchy of Borel sets of finite rank has length 1 £o where 1 eo is the limit 
of the ordinals a„ defined by oil = lo\ and a n+ \ = tu" n for n a non negative integer, 
ui\ being the first non countable ordinal. Then 1 £o is the first fixed point of the ordinal 
exponentiation of base w\. The length of the Wadge hierarchy of Borel sets in A° = 
X° n 11° is the uj[ h fixed point of the ordinal exponentiation of base ui, which is a 
much larger ordinal. The length of the whole Wadge hierarchy of Borel sets is a huge 
ordinal, with regard to the Lu[ h fixed point of the ordinal exponentiation of base u}\. It is 
described in [Wad83 ]Dup01) by the use of the Veblen functions. 



4 Topological complexity of context-free u; -languages 

We recall first results about the topological complexity of regular w-languages. Topo- 
logical properties of regular w-languages were first studied by L. H. Landweber in 
BLan691 where he characterized regular w-languages in a given Borel class. It turned 
out that a regular w-language is a Il^-set iff it is accepted by a deterministic Biichi au- 
tomaton. On the other hand Mc Naughton's Theorem implies that regular ^-languages, 
accepted by deterministic Muller automata, are boolean combinations of regular u>- 
languages accepted by deterministic Biichi automata. Thus they are boolean combina- 
tions of Il^-sets hence A§-sets. Moreover Landweber proved that one can effectively 
determine the exact level of a given regular w-language with regard to the Borel hierar- 
chy. 

A great improvement of these results was obtained by Wagner who determined in an 
effective way, using the notions of chains and superchains, the Wadge hierarchy of the 
class REG^j, [Wag79]. This hierarchy has length u u and is now called the Wagner 
hierarchy, ISel95ISel03alSel03blSel98ISta97al . Wilke and Yoo proved in BWY95I that 
one can compute in polynomial time the Wadge degree of a regular w-language. Later 
Carton and Perrin gave a presentation of the Wagner hierarchy using algebraic notions 
of w-semigroups, |CP99 CP 97IPP041 . This work was completed by Duparc and Riss in 
IIDR051 . 

Context-free ^-languages beyond the class A§ have been constructed for the first time 
in BFinOlcl . The construction used an operation of exponentiation of sets of finite or in- 



finite words introduced by Duparc in his study of the Wadge hierarchy [DupOl |. We are 
going now to recall these constructions although some stronger results on the topolog- 
ical complexity of context-free w-languages were obtained later in [Fin05a Fin06a] by 
other methods. However the methods of [Fin01c| using Duparc's operation of exponen- 
tiation are also interesting and they gave other results on ambiguity and on w-powers 
of context-free languages we can not (yet ?) get by other methods, see Sections|6]and|7] 
below. 



Wadge gave a description of the Wadge hierarchy of Borel sets in [Wad83|. Duparc 
recently got a new proof of Wadge's results and gave in [Dup 95|E)up01 1 a normal form 
of Borel sets in the class A° , i.e. an inductive construction of a Borel set of every given 
degree smaller than the uif 1 fixed point of the ordinal exponentiation of base u\. The 
construction relies on set theoretic operations which are the counterpart of arithmetical 



operations over ordinals needed to compute the Wadge degrees. 

In fact Duparc studied the Wadge hierarchy via the study of the conciliating hierarchy. 
Conciliating sets are sets of finite or infinite words over an alphabet X, i.e. subsets of 
X* U X u = X- u . It turned out that the conciliating hierarchy is isomorphic to the 
Wadge hierarchy of non-self-dual Borel sets, via the correspondence A — > A d we now 
recall : 

For A C X— and d a letter not in X, A d is the w-language over X U {d} which is 
defined by : 

A d = {x e (X u | x(/d) e A} 

where x(/d) is the sequence obtained from x when removing every occurrence of the 
letter d. 

We are going now to introduce the operation of exponentiation of conciliating sets. 

Definition 20 (Duparc [DupOl |). Let U be a finite alphabet and £j t and let x be 

a finite or infinite word over the alphabet X = S U {<^}. 
Then x*"~ is inductively defined by: 

A^ = A, 

and for a finite word u G (S U {«—})*•" 
(it.a)* - = u*~.a, if a G S, 

[u. ^-)^ = u*~ with its last letter removed if |it*~ | > 0, 
i.e. (u. <^)^ = u*-(l).u*-(2) ...tt*-(|u*-| - 1) if\u*~\ > 0, 
(u. «-)*" =\tf\u*-\ = 0, 
and for u infinite: 

{u)*~ = lim„ e[iJ where, given (3 n and v in £*, 

v C lim nEa) /3 n <-» ElnVp > n /3 p [\v\] = v. 

(The finite or infinite word lim„ ew f3 n is determined by the set of its (finite) prefixes). 

Remark 21. For x G X- u , x*~ denotes the string x, once every «- occuring in x has 
been "evaluated" to the backspace operation, proceeding from left to right inside x. In 
other words x*~ = xfrom which every interval of the form "a «— " (a G S) is removed. 

For example if u = (a <^) r \ for n an integer > 1, or u = (a or u = (a <^<^) w , 
then (it)*" = A. If u = (ab <^) w then (u)*" = a" and if u = bb(«- a)" then (u)*~ = b. 

Let us notice that in Definition [20] the limit is not defined in the usual way: 
for example if u = bb(<*- a)" the finite word u[n]*~ is alternatively equal to b or to 
ba: more precisely u[2n + 1]*" = b and u[2n + 2]^ = ba for every integer n > 
1 (it holds also that = b and u[2]*~ = bb). Thus Definition l20l implies that 

lim ngw (u[n])^ = b so u*~ = b. 

We can now define the operation A — > A~ of exponentiation of conciliating sets: 
Definition 22 (Duparc | DupOl]). For A C and £, let 

A~= df {xe(ZU{^})^\x^€A}. 



The operation ~ is monotone with regard to the Wadge ordering and produces some 
sets of higher complexity. 



Theorem 23 (Duparc pup()l| ). Let A C and n > 1. if A d C (17 u 

w a S°- complete (respectively, Tl^-complete) set, then (A~) d is a S° , ^complete 
(respectively, Tl® l+1 -complete) set. 

It was proved in [Fin01c| that the class of context-free infinitary languages (which are 
unions of a context-free finitary language and of a context-free w-language) is closed 
under the operation A — > A~. On the other hand A — - > A d is an operation from the 
class of context-free infinitary languages into the class of context-free w-languages. 
This implies that, for each integer n > 1, there exist some context-free ^-languages 
which are S°-complete and some others which are IT"-complete. 

Theorem 24 (|Fin01c|). For each non negative integer n > 1, there exist TP n -complete 
context-free Lo-languages A n and Tl^-complete context-free to-languages B n . 

Proof. For n = 1 consider the S"-complete regulars-language 

4i = {ae {0, 1} U | 3i a(i) = 1} 
and the n"-complete regular w-language 

Bx = {a G {0, 1}" | Mi a(i)=0}. 

These languages are context-free w-languages because REG^ C CFL^. 
Now consider the Xj-complete regular cj-language 
A2 = {ae{0,l}' J |3 < ^ a(i) = l} 
and the Ilj -complete regular cj-language 

B 2 = {ae {0, 1} W | 3 w z a(i)=0}, 

where 3 <w z means: " there exist only finitely many i such that . . ." , and 

means: " there exist infinitely many i such that . . .". 
A2 and B2 are context-free cj-languages because they are regular ^-languages. 

To obtain context-free w-languages of greater Borel ranks, consider now 0\ (respec- 
tively, Ci ) subsets of {0, 1}- W such that (Ox) d (respectively, {Cx) d ) are Sj-complete 
( respectively ITj'-complete ) . 

For example Ox = {x G {0, 1}^ | 3 i x(i) = 1} and d = {A}. 

We can apply n > 1 times the operation of exponentiation of sets. 
More precisely, we define, for a set A C X- w : 

= A 
A~ A = A~ and 
^~.(n+l) _ (^4~-™)~ _ 

Now apply n times (for an integer n > 1) the operation ~ (with different new letters 
«— 1, «-2, «— 3, . . . , «— n) to Oi and C\. 

By Theorem[23] it holds that for an integer n > 1: 

(0^ ") d is a £° +1 -complete subset of {0, 1, . . . , «- n , 

(C£--") d is a n° +1 -complete subset of {0, 1, . . . , 



And it is easy to see that 0\ and C\ are in the form EUF where E is a finitary context- 
free language and F is a context-free ^-language. Then the ^-languages (0^ ") d 
and (Ci' n ) d are context-free. Hence the class CFL^ exhausts the finite ranks of the 
Borel hierarchy: we obtain the context-free cj-languages A„ = (0~- ( ™ _1) ) d and B n = 
(Cr- ( " _1) ) d ,forn > 3. 

□ 

This gave a partial answer to questions of Thomas and Lescow [LT94| about the hier- 
archy of context-free w-languages. 

A natural question now arose: Do the decidability results of [Lan69| extend to context- 
free w-languages? Unfortunately the answer is no. Cohen and Gold proved that one 
cannot decide whether a given context-free w-language is in the class n°, or Iljj, 
|CG77 |. This result was first extended to all classes S° and 11°, for n an integer > 1, 
using the undecidability of the Post Correspondence Problem, BFinOlcl . 

Later, the coding of an infinite number of erasers <^ n , n > 1, and an iteration of the 
operation of exponentiation were used to prove that there exist some context-free u>- 
languages which are Borel of infinite rank, [Fin03c|. 

Using the correspondences between the operation of exponentiation of sets and the 
ordinal exponentiation of base u>\, and between the Wadge's operation of sum of sets, 
| Wad83 DupOl |, and the ordinal sum, it was proved in [FinOld] that the length of the 
Wadge hierarchy of the class CFL U is at least eo> the first fixed point of the ordinal 
exponentiation of base oj. Next were constructed some A° context-free w-languages 
in e w Wadge degrees, where is the ui th fixed point of the ordinal exponentiation of 
basea>, and also some S° -complete context-free w-languages, [FinOlb Fin05b]. Notice 
that the Wadge hierarchy of non-deterministic context-free ^-languages is not effective, 
IFinOldl . 

The question of the existence of non-Borel context-free w-languages was solved by 
Finkel and Ressayre. Using a coding of infinite binary trees labeled in a finite alphabet 
S, it was proved that there exist some non-Borel, and even £ \ -complete, context-free 
w-languages, and that one cannot decide whether a given context-free w-language is 
a Borel set, [Fin03b|. Amazingly there is a simple finitary language V accepted by a 
1-counter automaton such that V u is £}-complete; we shall recall it in Section|7]below 
on w-powers. 

But a complete and very surprising result was obtained in |Fin05a Fin06a|, which ex- 
tended previous results. A simulation of multicounter automata by 1-counter automata 
was used in [Fin05a Fin06a|. We firstly recall now the definition of these automata, in 
order to sketch the constructions involved in these simulations. 

Definition 25. Let k be an integer > 1. A k-counter machine (k-CM) is a 4-tuple 
A4 = (K, S, A, qo), where K is a finite set of states, E is a finite input alphabet, qo G K 
is the initial state, and A C K x (£L){\}) x {0, l} k x K x {0, 1, — l} fe is the transition 



relation. The k-counter machine M. is said to be real time iff: A C K x S X {0, l} k X 
K X {0, 1, — l} k , i.e. iff there are not any X-transitions. 

If the machine A4 is in state q and a G N is the content of the i th counter Ci then the 
configuration (or global state) of M. is the (k + \)-tuple (q, c\ 1 . . . , c&). 

For a G £ U {A}, q,q' G K and (c\, . . . , C&) G N fc such that Cj — for jG.EC 
{l,...,fc} andc, > Oforj £ E, if(q, a, i u . . . , i k , q', j u . . . , j k ) G A where ij = 
for j G E and L — lfor j (f. E, then we write: 

a : {q,c x , ...,c k ) ^ M (q',Ci + j 1 , . . -,c k +j k ) 

Thus we see that the transition relation must satisfy: 

if (q,a,ii,..., i k , q', ji,..., jk) G A and i m = for some m G {!,..., k}, then 
jm = or j rn = 1 ( but j m may not be equal to — 1 ). 

Let a = a\a,2 . . . a n . . . be an co-word over S. An co-sequence of configurations r = 
[qi, c\, . . . c % k )i>\ is called a run of M. on a, starting in configuration (p, c\, . . . , c k ), 
iff: 

(1) (qx,c\,...c\) = (p,a,...,Ck) 

(2) for each i > 1, there exists bi G S U {A} such that bi : (qi,c\, 
(qi+i, c^ +1 , . . . c)j, +1 ) such that either a±a2 . . . a n . . . — 6162 • • ■ b n . 
or bib 2 . . . b n . . . is a finite prefix of aia 2 . . . a„ . . . 

The run r is said to be complete when a\a 2 ■ ■ ■ a n ■ ■ ■ = b\b 2 ■ ■ - b n . . . 
For every such run, In(r) is the set of all states entered infinitely often during run r. 
A complete run r of M on a, starting in configuration (go>0, . . . ,0), will be simply 
called "a run of M on a". 

Definition 26. A Bilchi k-counter automaton is a 5-tuple Ai = (K, S, A, q , F), where 
Ai'=(K, S, A, qo) is a k-counter machine and F C K is the set of accepting states. 
The co-language accepted by M. is 

L(M)= {a G S u I there exists a run r of Ad on a such that In(r) PI F ^ 0} 

The notion of Muller fc-counter automaton is defined in a similar way. One can see that 
an w-language is accepted by a (real time) Biichi fc-counter automaton iff it is accepted 
by a (real time) Muller fc-counter automaton [EH93 1. Notice that this result is no longer 
true in the deterministic case. 

We denote BC(fc) (respectively, r-BC(fc)) the class of Biichi fc-counter automata (re- 
spectively, of real time Biichi fc-counter automata. 

We denote BCL(fc) w (respectively, r-BCL(fc) w ) the class of cj-languages accepted 
by Biichi fc-counter automata (respectively, by real time Biichi fc-counter automata). 

Remark that 1 -counter automata introduced above are equivalent to pushdown automata 
whose stack alphabet is in the form {Zo,A} where Zq is the bottom symbol which al- 
ways remains at the bottom of the stack and appears only there and A is another stack 
symbol. The pushdown stack may be seen like a counter whose content is the integer 




N if the stack content is the word A N .Zq. 

In the model introduced here the counter value cannot be increased by more than 1 dur- 
ing a single transition. However this does not change the class of a>languages accepted 
by such automata. So the class BCL(l) a) is equal to the class 1-ICL W , introduced in 
[ FinO 1 d | , and it is a strict subclass of the class CFL W of context-free w-languages 
accepted by Biichi pushdown automata. 

We state now the surprising result proved in [Fin06a|, using multicounter-automata. 

Theorem 27 ([Fin06a |). The Wadge hierarchy of the class r-BCL(l) w , hence also of 
the class CFL W , or of every class C such that r-BCL(l) w C CC E\, is the Wadge 
hierarchy of the class E\ of co-languages accepted by Turing machines with a Biichi 
acceptance condition. 

We now sketch the proof of this result. It is well known that every Turing machine can 
be simulated by a (non real time) 2-counter automaton, see HHU791 . Thus the Wadge 
hierarchy of the class BCL(2) W is also the Wadge hierarchy of the class of w-languages 
accepted by Biichi Turing machines. 

One can then find, from an cj-language L C E u in BCL(2) W , another w-language 
9s{L) which will be of the same topological complexity but accepted by a real-time 
8-counter Biichi automaton. The idea is to add firstly a storage type called a queue to a 
2-counter Biichi automaton in order to read u-words in real-time. Then the queue can 
be simulated by two pushdown stacks or by four counters. This simulation is not done 
in real-time but a crucial fact is that one can bound the number of transitions needed to 
simulate the queue. This allows to pad the strings in L with enough extra letters so that 
the new words will be read in real-time by a 8-counter Biichi automaton. The padding 
is obtained via the function 9 s which we define now. 

Let E be an alphabet having at least two letters, E be a new letter not in E, S be an 
integer > 1, and 9 S : S u -> (S U {E}) u be the function defined, for all x € E u , by: 

9 s (x) = x(l).E s .x(2).E s2 .x(3).E s3 .x(4) . . . x(n).E sn .x{n + 1).E S " +1 . . . 

It turns out that if L C E u is in BCL(2) W then there exists an integer S > 1 such that 
9s{L) is in the class r-BCL(8) w , and, except for some special few cases, 9s(L) =w L. 

The next step is to simulate a real-time 8-counter Biichi automaton, using only a real- 
time 1 -counter Biichi automaton. 

Consider the product of the eight first prime numbers: 

K = 2.3.5.7.11.13.17.19 = 9699690 
Then an cj-word x G E u can be coded by the w-word 

h(x) = A.0 K .x(1).B.0 k \a.0 k2 .x(2).B.0 k3 .A.0 k3 .x(3).B . . . B.0 K " .A.0 K " .x(n).B . . . 



where A, B and are new letters not in S. The mapping h : E u — > (S U {A, B, 0})" 
is continuous. It is easy to see that the cj-language h(ZJ u )~ is an open subset of (S U 
{A, B, 0})" and that it is in the class r-BCL(l) w . 

If L(A) C S u is accepted by a real time 8-counter Biichi automaton A, then one can 
construct effectively from A a 1-counter Biichi automaton B, reading words over the 
alphabet £ U {A, B, 0}, such that L{A)= h-\L{B)), i.e. 

\fx e h(x) G L(B) < — ► x G L(.4) 

In fact, the simulation, during the reading of h(x) by the 1-counter Biichi automa- 
ton £>, of the behaviour of the real time 8-counter Biichi automaton A reading x, can 
be achieved, using the coding of the content (ci, c%, . . . , c$) of eight counters by the 

product 2 Cl .3 C2 (17) C7 .(19) Cs , and the special shape of w-words in h(E u ) which 

allows the propagation of the value of the counters of A. A crucial fact here is that 
h(E u1 )'' is in the class r-BCL(l) a) . Thus the w-language 

h(L(A)) U hiE 1 ")- = L(B) U h[S w )- 

is in the class BCL(1) W and it has the same topological complexity as the w-language 
L(A), (except the special few cases where dw{L(A)) < u>). 

One can see, from the construction of B, that at most (K — 1) consecutive A-transitions 
can occur during the reading of an w-word x by B. It is then easy to see that the u>- 
language <f>(h(L(A))*Jh(E u )~) is an w-language in the class r-BCL(l) u which has 
the same topological complexity as the w-language L(A), where <f> is the mapping from 
{£ U {A, B, 0})" into (S U {A, B, F, 0}) u , with F a new letter, which is defined by: 

4>{ x ) = F K - 1 .x{l).F K - 1 .x{2).F K - 1 .x(3) . . . F K ~ 1 .x(n).F K ~ 1 .x(n+l).F K ~ 1 . . . 

Altogether these constructions are used in [Fin06a| to prove Theorem[27] As the Wadge 
hierarchy is a refinement of the Borel hierarchy and, for any countable ordinal a, S° - 
complete sets (respectively, n° -complete sets) form a single Wadge degree, this implies 
also the following result. 

Theorem 28. Let C be a class of lo -languages such that: 

r-BCL(l) u CCC E\. 

(a) The Borel hierarchy of the class C is equal to the Borel hierarchy of the class E\. 

(b) 72 = Sup {a | 3L G C such that L is a Borel set of rank a}. 

(c) For every non null ordinal a < a->f K , there exists some YP a -complete and some 
YY a a -complete uj-languages in the class C. 

Notice that similar methods have next be used to get another surprising result: the 
Wadge hierarchy, hence also the Borel hierarchy, of infinitary rational relations accepted 
by 2-tape Biichi automata is equal to the Wadge hierarchy of the class r-BCL(l) w or 
of the class E{, IIFin06bl . 



5 Topological complexity of deterministic context-free 
^-languages 



We have seen in the preceeding section that all non-deterministic finite machines ac- 
cept cj-languages of the same topological complexity, as soon as they can simulate a 
real time 1-counter automaton. 

This result is still true in the deterministic case if we consider only the Borel hier- 
archy. Recall that regular w-languages accepted by Biichi automata are IT^-sets and 
w-languages accepted by Muller automata are boolean combinations of IlfJ-sets hence 
A§-sets. Engelfriet and Hoogeboom proved that this result holds also for all w-languages 
accepted by deterministic X-automata, i.e. automata equipped with a storage type X, 
including the cases of fc-counter automata, pushdown automata, Petri nets, Turing ma- 
chines. In particular, cj-languages acccepted by deterministic Biichi Turing machines 
are n^-sets and w-languages acccepted by deterministic Muller Turing machines are 
Assets. 



It turned out that this is no longer true if we consider the much finer Wadge hierarchy to 
measure the complexity of w-languages. The Wadge hierarchy is suitable to distinguish 
the accepting power of deterministic finite machines reading infinite words. Recall that 
the Wadge hierarchy of regular w-languages, now called the Wagner hierarchy, has been 
effectively determined by Wagner; it has length |Wag79|Sel95ISel98l . 



Its extension to deterministic context-free w-languages has been determined by Du- 
parc, its length is uj^ ) [DFR01 Dup03 |. To determine the Wadge hierarchy of the class 
DCFL U , Duparc first defined operations on DMPDA which correspond to ordinal op- 
erations of sum, multiplication by u, and multiplication by u>\, over Wadge degrees. 
This way are constructed some DMPDA accepting a>languages of every Wadge degree 
in the form : 



d° w (A) = u'l 3 .5j + lj^- 1 .5j- x + ...+CJ? 1 A 

wherej > is an integer, rij > %-i > ■■• > n\ are integers > 0, and Sj, <5j-i, 8% 
are non null ordinals < to". 



On the other hand it is known that the Wadge degree a of a boolean combination of 
Ilj-sets is smaller than the ordinal df thus it has a Cantor normal form : 

a = lo'] 3 .Sj + ,5j-i + . . . + ui" 1 .Si 

where j > is an integer, rij > rij-i > . . . > n% are integers > 0, and Sj, 8j-%, . . . , 6% 
are non null ordinals < uj\, i.e. non null countable ordinals. In a second step it is proved 
in |Dup03) , using infinite multi -player games, that if such an ordinal a is the Wadge 

degree of a deterministic context-free ^-language, then all the ordinals Sj, Sj-% #i 

appearing in its Cantor normal form are smaller than the ordinal < lo u . Thus the Wadge 
hierarchy of the class DCFL U is completely determined. 



Theorem 29 (Duparc [Dup03 |). The Wadge degrees of deterministic context-free ui- 
languages are exactly the ordinals in the form : 

a = w 1 .dj + -Oj-i + . . . + w l .di 

where j > is an integer, rij > n^-i > ... > n\ are integers > 0, andSj, Sj—i, . . . , Si 
are non null ordinals < uj w . 

The length of the Wadge hierarchy of the class DCFL U is the ordinal (lu u ) u = u>^ u \ 

Notice that theWadge hierarchy of DCFL^ is not determined in an effective way in 
[Dup03 1. The question of the decidability of problems like: "given two DMPDA A and 
B, does L(A) <w L(B) hold ?" or "given a DMPDA A can we compute d%,(L(A))T 
naturally arises. 

Cohen and Gold proved that one can decide whether an effectively given w-language 
in DCFL U in an open or a closed set [CG77]. Linna characterized the ^-languages 
accepted by DBPDA as the Tl^-sets in DCFL^ and proved in IILin77H that one can 
decide whether an effectively given w-language accepted by a DMPDA is a TI§-set or 
a S°-set. 

Using a recent result of Walukiewicz on infinite games played on pushdown graphs, 
|Wal00|, these decidability results were extended in [FinOlc] where it was proved that 
one can decide whether a deterministic context-free w-language accepted by a given 
DMPDA is in a given Borel class II J, or 11° or even in the wadge class [L] 
given by any regular w-language L. 

An effective extension of the Wagner hierarchy to ^-languages accepted by Muller 
deterministic real time blind (i. e. without zero-test) 1-counter automata has been deter- 
mined in |Fin01a|. Recall that blind 1-counter automata form a subclass of 1-counter 
automata hence also of pushdown automata. A blind 1-counter Muller automaton is 
just a Muller pushdown automaton M = (K, S, r, 6, qo, Z , T) such that F = {Z , 1} 
where Z is the bottom symbol and always remains at the bottom of the store. More- 
over every transition which is enabled at zero level is also enabled at non zero level, i.e. 
if S(q, a, Zq) — (p, F l Z ), for some p,q E K, a £ S and n > 0, then S(q, a, I) = 
(p, I n+1 ). But the converse may not be true, i.e. some transition may be enabled at non 
zero level but not at zero level. Notice that blind 1-counter automata are sometimes 
called partially blind 1-counter automata as in IIGre78l . 

The Wadge hierarchy of blind counter u>-languages, accepted by deterministic Muller 
real time blind 1-counter automata (MBCA), is studied in [ FinO 1 a | in a similar way as 
Wagner studied the Wadge hierarchy of regular cj-languages in [Wag79|. Chains and 
superchains for MBCA are defined as Wagner did for Muller automata. The essential 
difference between the two hierarchies relies on the existence of superchains of trans- 
finite length a < uj 2 for MBCA when in the case of Muller automata the superchains 
have only finite lengths. The hierarchy of w-languages accepted by MBCA is effective 
and leads to effective winning strategies in Wadge games between two players in charge 
of w-languages accepted by MBCA. Concerning the length of the Wadge hierarchy of 
MBCA the following result is proved : 



Theorem 30 (Finkel |Finj)Ta)). 



(a) The length of the Wadge hierarchy of blind counter uj-languages in is ui 2 . 

(b) The length of the Wadge hierarchy of blind counter ui-languages is the ordinal lj^ 
(hence it is equal to the length of the Wagner hierarchy). 

Notice that the length of the Wadge hierarchy of blind counter w-languages is equal to 
the length of the Wagner hierarchy although it is actually a strict extension of the Wag- 
ner hierarchy, as shown already in item (a) of the above theorem. The Wadge degrees 
of blind counter w-languages are the ordinals in the form : 

a = uj" 3 .Sj + ui" 3 ^ 1 ,5j-i + . . . + uj™ 1 .Si 

where j > is an integer, rij > rij-x > . . . > n\ are integers > 0, and Sj , Sj-i , . . . , S\ 
are non null ordinals < uj 2 . Recall that in the case of Muller automata, the ordinals 
Sj, Sj-x> • - • , <5i are non-negative integers, i.e. non null ordinals < uj. 

Notice that Selivanov has recently determined the Wadge hierarchy of cj-languages 
accepted by deterministic Turing machines; its length is (u>f K )" [Sel03b Sel03a|. The 
w-languages accepted by deterministic Muller Turing machines or equivalently which 
are boolean combinations of arithmetical ili? -sets have Wadge degrees in the form : 

a = u> 1 .dj + ui-y J -dj-i + . . . + oj 1 .oi 

where j > is an integer, rij > Tij-i > ■ • • > n± are integers > 0, and 8j , Sj- % , . . . , Si 
are non null ordinals < cjf K . 

6 Topology and ambiguity in context-free cj-languages 

The notions of ambiguity and of degrees of ambiguity are well known and important 
in the study of context-free languages. These notions have been extended to context- 
free ^-languages accepted by Buchi or Muller pushdown automata in [Fin03a|. Notice 
that it is proved in |Fin03a] that these notions are independent of the Buchi or Muller 
acceptance condition. So in the sequel we shall only consider the Biichi acceptance 
condition. 

We now firstly introduce a slight modification in the definition of a run of a Biichi 
pushdown automaton, which will be used in this section. 

Definition 31. Let A — (K, S, r, 5, qo, Zq, F) be a Biichi pushdown automaton. 
Let a — a\a,2 . . . a n . . . be an ui-word over S. A run of A on a is an infinite sequence 
r = (ft) 7ij £ i)i>i where (qi, 7j)j>i is an infinite sequence of configurations of A and, 
for all i > 1, £j 6 {0, 1} and: 

L (qi, 7i ) = (<7o,Z ) 

2. for each i > 1, there exists bi G S U {A} satisfying 

h ■ fe,7i) (ft+i,7i+i) 
and( £i — iffbi = A ) 

and such that a\a^ . . . a n . . . = 6162 . . .b n . . . 



As before the uj-language accepted by A is 

L(A) = {u G £ u | there exists a run r of A on a such that In(r) (~l F ^ 0} 

Notice that the numbers Ei € {0, 1} are introduced in the above definition in order to 
distinguish runs of a BPDA which go through the same infinite sequence of configura- 
tions but for which A-transitions do not occur at the same steps of the computations. 

As usual the cardinal of ui is denoted Ho and the cardinal of the continuum is denoted 
2 K °. The latter is also the cardinal of the set of real numbers or of the set £ u for every 
finite alphabet £ having at least two letters. 

We are now ready to define degrees of ambiguity for BPDA and for context-free uo- 
languages. 

Definition 32. Let Abe a BPDA reading infinite words over the alphabet £. For x G 
£ u let a A (x) be the cardinal of the set of accepting runs of A on x. 

Lemma 33 ([Fin03a|). Let Abe a BPDA reading infinite words over the alphabet £. 
Then for all x G £" it holds that a A (x) G NU {H ,2 N °}. 

Definition 34. Let Abe a BPDA reading infinite words over the alphabet £. 

(a) Ifswp{a A (x) | x G £ u } G N U {2 N °}, then a A = sup{a A (x) | x G £"}. 

(b) If sixp{a A (x) | x G £^} — Ho an d there is no word x G £ u such that ct A {x) = 
Ho, then a A = Nq . 

(Hg~ does not represent a cardinal but is a new symbol that is introduced here to 
conveniently speak of this situation). 

(c) If sup{a A (x) | x G £^} = Ho and there exists (at least) one word x G £ w such 
that ct A {x) — Ho, then a A = Ho 

Notice that for a BPDA A, a A = iff A does not accept any w-word. 

We shall consider below that N U {Hq~, H , 2 n °} is linearly ordered by the relation <, 

which is defined by : Vfc G N, k < k + 1 < H < H < 2 K °. 

Definition 35. For k G N U {H^, H , 2 N °} let 
CFL u (a <k) = {L(A) \Aisa BPDA with a A < k} 
CFL u (a <k) = {L(A) \Aisa BPDA with a A < k} 

NA—CFL U — CFL u (a < 1) is the class of non ambiguous context-free lo -languages. 
For every integer k such that k > 2, or k G {H^ , Ho, 2 No }, 
A(k) ~ CFL^ = CFLuj(a < k) - CFL u (a < k) 

If L G A(k) - CFL U with k G N, k > 2, or k G {H , H , 2 H °}, then L is said to be 
inherently ambiguous of degree k. 

Notice that one can define in a similar way the degree of ambiguity of a finitary context- 
free language. If M is a pushdown automaton accepting finite words by final states (or 
by final states and topmost stack letter) then aj/ £ N or ccm = Hq~ or a^j = Ho. 
However every context-free language is accepted by a pushdown automaton M with 
&M ^ ^o~' IABB961 . We denote the class of non ambiguous context-free languages by 
NA — CFL and the class of inherently ambiguous context-free languages by A — CFL. 
Then one can state the following result. 



Theorem 36 ([Fin03a|). 



NA-CFL^ C co-KC(NA-CFL) 

A-CFL U <£ uj-KC(A-CFL) 

We now come to the study of links between topology and ambiguity in context-free 
w-languages [Fin03a FSQ3]. 

Using a Theorem of Lusin and Novikov, and another theorem of descriptive set theory, 
see l|Kec95l page 123], Simonnet proved the following strong result which shows that 
non-Borel context-free w-languages have a maximum degree of ambiguity. 

Theorem 37 (Simonnet |FS03|). Let L(A) be a context-free uj-language accepted by 
a BPDA A such that L{A) is an analytic but non Borel set. The set of co-words, which 
have 2 N ° accepting runs by A, has cardinality 2 N °. 

On the other hand, it turned out that, informally speaking, the operation A — » A~ con- 
serves globally the degrees of ambiguity of infinitary context-free languages (which are 
unions of a finitary context-free language and of a context-free w-language). Then, start- 
ing from known examples of finitary context-free languages of a given degree of am- 
biguity, are constructed in [Fin03a| some context-free w-languages of any finite Borel 
rank and which are non-ambiguous or of any finite degree of ambiguity or of degree 

No- 
Theorem 38. 

1. For each non negative integer n > 1, there exist Y^-complete non ambiguous 
context-free co-languages A n and Tl^-complete non ambiguous context-free co- 
languages B n . 

2. Let k be an integer > 2 or k — Kq. Then for each integer n > 1, there exist 
H a n -complete context-free co-languages E n (k) and II"- complete context-free co- 
languages F n (k) which are in A{k) — CFL^,, i.e. which are inherently ambiguous 
of degree k. 

Notice that the ^-languages A n and B n are simply those which were constructed in the 
proof of Theorem [24] On the other hand it is easy to see that the BPDA accepting the 
context-free w-language which is Borel of infinite rank, constructed in |Fin03c | using an 
iteration of the operation A — > A~, has an infinite degree of ambiguity. And 1 -counter 
Biichi automata accepting context-free w-languages of any Borel rank of an effecive 
analytic set, constructed via simulation of multicounter automata, may also have a great 
degree of ambiguity. So this left open some questions we shall detail in the last section. 

We indicate now a new result which follows easily from the proof of Theorem [271 
sketched in Section|4]above, see [Fin06a|. Consider an w-language L accepted by a de- 
terministic Muller Turing machine or equivalently by a deterministic 2-counter Muller 
automaton. We get first an w-language 8s(L) C which has the same topological 
complexity (except for finite Wadge degrees), and which is accepted by a deterministic 
real time 8-counter Muller automaton A. 



Then one can construct from A a 1-counter Muller automaton B, reading words over 
the alphabet E U {A, B, 0}, such that h{L{A)) U h{E^)- = L(B) U h(£ u )~ , where 
h : E" — > (E U {A, B, 0})" is the mapping defined in Section @] Notice that the 1- 
counter Muller automaton B which is constructed is now also deterministic. 
On the other hand it is easy to see, from the decomposition given in |Fin06a Proof of 
Lemma 5.3], that the w-language h(E u> )~ is accepted by a 1-counter Biichi automa- 
ton which has degree of ambiguity 2 and the w-language L(B) is in NA — CFL U = 
CFL UJ (a < 1) because it is accepted by a deterministic 1-counter Muller automa- 
ton. Then we can easily infer, using |Fin03a Theorem 5.16 (c)] that the w-language 
h(L(A)) Uh(£ u )- = L(B)Uh{E UJ )~ is in CFL„(a < 3). And this w-language has 
the same complexity as L(A) Thus we can state the following result. 

Theorem 39. For each cu-language L accepted by a deterministic Muller Turing ma- 
chine there is an ui-language L' G CFL u (a < 3), accepted by a 1-counter Muller 
automaton T> with a-p < 3, such that L =w L'. 

7 u;-powers of context-free languages 

The w-powers of finitary languages are cj-languages in the form V u , where V is a 
finitary language over a finite alphabet E. They appear very naturally in the character- 
ization of the class REG^ of regular w-languages (respectively, of the class CFL^ 
of context-free cj-languages) as the w-Kleene closure of the family REG of regular 
finitary languages (respectively, of the family CF of context-free finitary languages) . 
The question of the topological complexity of a>powers naturally arises and was raised 
by Niwinski HNiw90l . Simonnet HSim92l . and Staiger HSta97bl . 

An w-power of a finitary language is always an analytic set because it is either the 
continuous image of a compact set {0, 1, ... , n} u for n > or of the Baire space w". 
The first example of finitary language L such that U° is analytic but not Borel, and even 
S}-complete, was obtained in [Fin03b|. Amazingly the language L was very simple 
and even accepted by a 1-counter automaton. It was obtained via a coding of infinite 
labelled binary trees. 

We now give a simple construction of this language L using the notion of substitution 
which we now recall. A substitution is defined by a mapping / : £ — * V(r*), where 
E = {ai, . . . ,a n } and r are two finite alphabets, / : a% — * Li where for all integers 
i G [1; n], f(cii) = Li is a finitary language over the alphabet r. 
Now this mapping is extended in the usual manner to finite words: /(a^ . . . aj„) = 
L ix . . . L in , and to finitary languages L C £*: f(L) = U xe Lf(x). If for each integer 
i G [1; n] the language Lj does not contain the empty word, then the mapping / may be 
extended to w-words: f(x(l) . . . x{n) . . .) = {u\ . . . u n . . . \ Vi > 1 u, G f{x{i))} 
and to w-languages L C E w by setting f(L) = U xe Lf(x). 

Let now E = {0, 1} and d be a new letter not in E and 



D = {u.d.v | u,v G E* and (\v\ = 2\u\) or (\v\ = 2\u\ + 1) } 



D C (£ U {d})* is a context-free language accepted by a 1-counter automaton. Let 
g : S -> V((S U {d})*) be the substitution defined by g(a) = a.D. As W = 0*1 
is regular, L = g(W) is a context-free language and it is accepted by a 1-counter 
automaton. Moreover one can prove that {g{W)) u is Sj-complete, hence a non Borel 
set. This is done by reducing to this w-language a well-known example of H\ -complete 
set : the set of infinite binary trees labelled in the alphabet {0,1} which have an infinite 
branch in the IT^-complete set (0*.l) ClJ , see [Fin03b| for more details. 

Remark 40. The uj-language (g(W)) UJ is context-free. By Theorem 1771 every BPDA 
accepting (g(W)) u> has the maximum ambiguity and (g(W)) u G A(2^°) — CFL U . 
On the other hand we can prove that g{W) is a non ambiguous context-free language. 
This is used in HFS03V to prove that neither unambiguity nor ambiguity of context-free 
languages are preserved under the operation V — ► V u . 

Concerning Borel w-powers, it has been proved in MFinOlc l that for each integer n > 1, 
there exist some w-powers of context-free languages which are IT" -complete Borel sets. 
These results were obtained by the use of a new operation V — > V~ over w-languages, 
which is a slight modification of the operation V — > V~. The new operation V — > V~ 
conserves w-powers and context-freeness. More precisely if V = W u for some context- 
free language W, then V~ = T w for some context-free language T which is obtained 
from W by application of a given context-free substitution. And it follows easily from 
|Dup01| that if V C S u is a IT" -complete set, for some integer n > 2, then V~ is 
a -complete set. Then, starting from the Ilj-complete set (0*.l) w , we get some 
IT" -complete w-powers of context-free languages for each integer n > 3. 

An iteration of the operation V — > V~ was used in [Fin04| to prove that there exists 
a finitary language V such that is a Borel set of infinite rank. The language V was 
a simple recursive language but it was not context-free. Later, with a modification of 
the construction, using a coding of an infinity of erasers previously defined in [Fin03c|, 
Finkel and Duparc got a context-free language V such that V u is a Borel set above the 
class A° , llDFT)7l . 

The question of the Borel hierarchy of aj-powers of finitary languages has been solved 
very recently by Finkel and Lecomte in [FL07 1, where a very surprising result is proved, 
showing that actually w-powers exhibit a great topological complexity. For every non- 
null countable ordinal a there exist some S^-complete cj-powers and also some 11° - 
complete cj-powers. But the cj-powers constructed in |FL07 1 are not cj-powers of context- 
free languages, except for the case of a Sj-complete set. Notice also that an example of 
a regular language L such that is S°-complete was given by Simonnet in [Sim92|, 
see also IILec05l . 

8 Perspectives and open questions 

We give below a list of some open questions which arise naturally. The problems listed 
here seem important for a better comprehension of context-free w-languages but the list 
is not exhaustive. 



8.1 Effective results 



In the non-deterministic case, the Borel and Wadge hierarchies of context-free w-langua 
ges are not effective, IFinOlc Fin03b FinOld]. This is not surprising since most decision 
problems on context-free languages are undecidable. On the other hand we can expect 
some decidability results in the case of deterministic context-free cj-languages. We have 
already cited some of them : we can decide whether a deterministic context-free u>- 
language is in a given Borel class or even in the Wadge class [L] of a given regular 
cj-language L. The most challenging question in this area would be to find an effecive 
procedure to determine the Wadge degree of an w-language in the class DCFL^. 
Recall that the Wadge hierarchy of the class DCFL^ is determined in a non-effective 
way in | Dup03) . On the other hand the Wadge hierarchy of the class of blind counter 
w-languages is determined in an effective way, using notions of chains and superchains, 
in MFinOlal . There is a gap between the two hierarchies because (blind) 1 -counter au- 
tomata are much less expressive than pushdown automata. One could try to extend the 
methods of |Fin01a| to the study of deterministic pushdown automata. 
Another question concerns the complexity of decidable problems. A first question would 
be the following one. Could we extend the results of Wilke and Yoo to the class of blind 
counter cj-languages, i.e. is the Wadge degree of a blind counter w-language computable 
in polynomial time ? Otherwise what is the complexity of this problem ? Of course the 
question may be further posed for classes of ^-languages which are located between 
the classes of blind counter ^-languages and of deterministic context-free ^-languages. 
Another interesting question would be to determine the Wadge hierarchy of w-languages 
accepted by deterministic higher order pushdown automata (even firstly in a non effec- 
tive way), HEng83|CW07l . 



8.2 Topology and ambiguity 



Simonnet's Theorem [37] states that non-Borel context-free w-languages have a maxi- 
mum degree of ambiguity, i.e. are in the class A(2 Ha ) — CFL U . On the other hand, 
there exist some non-ambiguous context-free w-languages of every finite Borel rank. 
The question naturally arises whether there exist some non-ambiguous context-free uj- 
languages which are Wadge equivalent to any given Borel context-free a>language (or 
equivalently to any Borel Z^-set, by Theorem|28l>. This may be connected to a result of 
Arnold who proved in [Arn83| that every Borel subset of S u , for a finite alphabet E, 
is accepted by a non-ambiguous finitely branching transition system with Biichi accep- 
tance condition. By Theorem|38] if k is an integer > 2 or k = Kq\ then for each integer 
n > 1, there exist S° -complete context-free w-languages E n (k) and II" -complete 
context-free cj-languages F n (k) which are in A(k) — CFL^, i.e. which are inher- 
ently ambiguous of degree k. More generally the question arises : determine the Borel 
ranks and the Wadge degrees of context-free w-languages in classes CFL UJ (a < k) or 
A{k)-CFL UJ where k G N U {K , H , 2 H "} ( k > 2 in the case of A(k) — CFL U ). A 
first result in this direction is Theorem[39l stated in Section [6] 



8.3 u -powers 



The results of [FinOlc Fin03b Fin04 FL07| show that u-powers of finitary languages 
have actually a great topological complexity. Concerning w-powers of context-free lan- 
guages we do not know yet what are all their infinite Borel ranks. However the results 
of flFin06al suggest that w-powers of context-free languages or even of languages ac- 
cepted by 1-counter automata exhibit also a great topological complexity. 
Indeed Theorem [28] states that there are w-languages accepted by Biichi 1-counter au- 
tomata of every Borel rank (and even of every Wadge degree) of an effective analytic 
set. On the other hand each cj-language accepted by a Biichi 1-counter automaton can 
be written as a finite union L = Ui<i< n Ui.Vf, where for each integer i, Ui and V$ are 
finitary languages accepted by 1-counter automata. Then we can conjecture that there 
exist some cj-powers of languages accepted by 1-counter automata which have Borel 
ranks up to the ordinal 73, although these languages are located at the very low level in 
the complexity hierarchy of finitary languages. 

Recall that a finitary language L is a code (respectively, an w-code) if every word of L + 
(respectively, every w-word of L u ) has a unique decomposition in words of L, [BP85 1. 
It is proved in [FS03] that if V is a context-free language such that V u is a non Borel 
set then there are 2 N ° w-words of which have 2 N ° decompositions in words of V; 
in particular, V is really not an w-code although it is proved in [FS03] that V may be a 
code (see the example V=g(W) given in Section[7]i. The following question about Borel 
cj-powers now arises : are there some context-free codes (respectively, w-codes) V such 
that is S° -complete or -complete for a given countable ordinal a < 72 ? 
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